Application of the bispectrum to glottal pulse analysis

نویسنده

  • Jacqueline Walker
چکیده

Higher order spectral (HOS) techniques, such as the bispectrum, offer robustness to Gaussian noise and the ability to recover phase information. However, their drawbacks, such as the high variance of estimates and the need for long data records, have limited their use in conventional speech processing applications. As in glottal pulse estimation, all existing inverse filtering approaches use second-order statistics, it is of interest to explore the potential of HOS in this area. Using the theory of HOS factorization and the linear bispectrum, it is shown how voiced speech can be modelled as a nonGaussian coloured noise driven system. The linear bispectrum approach can be used to obtain alternative glottal pulse and vocal tract estimates in hybrid Iterative Adaptive Inverse Filtering (hIAIF) and the results are compared with traditional IAIF. Finally, a new technique which involves joint estimation of the glottal pulse and vocal tract followed by inverse filtering is presented. This new technique shows good preliminary results and is much simpler than previous techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Steady Flow Through Modeled Glottal Constriction

The airflow in the modeled glottal constriction was simulated by the solutions of the Navier-Stokes equations for laminar flow, and the corresponding Reynolds equations for turbulent flow in generalized, nonorthogonal coordinates using a numerical method. A two-dimensional model of laryngeal flow is considered and aerodynamic properties are calculated for both laminar and turbulent steady flows...

متن کامل

ACOUSTICAL STUDY ON SUB-HARMONIC OF GLOTTAL SOURCE IN MANDARIN TONES* Jiangping Kong Dept. of Chinese Language and Literature

This paper is concerned with the acoustical analysis on sub-harmonic of glottal source in Mandarin tones. The methods used in this research are: 1) extracting glottal source of tones by inverse filtering; 2) analyzing subharmonic and spectrum tilt by FTT; 3) simulating the double peak pulse by 4 functions and describing the natures of them in both time and frequency domains. There are 3 conclus...

متن کامل

Glottal area patterns in numerically simulated diplophonia

The presentation explores diplophonia via numerical simulations of glottal vibrations. The aim of the study is to improve the understanding of glottal wall vibration and area waveform patterns of nonmodal phonation as observed via laryngeal highspeed videos. Diplophonia has been described as the simultaneous perception of two pitches during voicing. A broader definition is the vibration of diff...

متن کامل

Vocal quality factors: analysis, synthesis, and perception.

The purpose of this study was to examine several factors of vocal quality that might be affected by changes in vocal fold vibratory patterns. Four voice types were examined: modal, vocal fry, falsetto, and breathy. Three categories of analysis techniques were developed to extract source-related features from speech and electroglottographic (EGG) signals. Four factors were found to be important ...

متن کامل

Introduction of low to high frequencies bispectrum rate feature for deep sleep detection from awakening by electroencephalogram

Background: Accurate detection of deep sleep (Due to the low frequency of the brain signal in this part of sleep, it is also called slow-wave sleep) from awakening increases the sleep staging accuracy as an important factor in medicine. Depending on the time and cost of manually determining the depth of sleep, we can automatically determine the depth of sleep by electroencephalogram (EEG) signa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003